To appear in: Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches
نویسندگان
چکیده
Whenever an intelligent agent learns to control an unknown environment, two opposing objectives have to be combined. On the one hand, the environment must be su ciently explored in order to identify a (sub-) optimal controller. For instance, a robot facing an unknown environment has to spend time moving around and acquiring knowledge. On the other hand, the environment must also be exploited during learning, i.e., experience made during learning must also be considered for action selection, if one is interested in minimizing costs of learning. For example, although a robot has to explore its environment, it should avoid collisions with obstacles once it has received some negative reward for collisions. For e cient learning, actions should thus be generated in such a way that the environment is explored and pain is avoided. This fundamental trade-o between exploration and exploitation demands e cient exploration capabilities, maximizing the e ect of learning while minimizing the costs of exploration.
منابع مشابه
Forecasting Industrial Production in Iran: A Comparative Study of Artificial Neural Networks and Adaptive Nero-Fuzzy Inference System
Forecasting industrial production is essential for efficient planning by managers. Although there are many statistical and mathematical methods for prediction, the use of intelligent algorithms with desirable features has made significant progress in recent years. The current study compared the accuracy of the Artificial Neural Networks (ANN) and Adaptive Nero-Fuzzy Inference System (ANFIS) app...
متن کاملThe use of wavelet - artificial neural network and adaptive neuro fuzzy inference system models to predict monthly precipitation
Precipitation forecasting due to its random nature in space and time always faced with many problems and this uncertainty reduces the validity of the forecasting model. Nowadays nonlinear networks as intelligent systems to predict such complex phenomena are widely used. One of the methods that have been considered in recent years in the fields of hydrology is use of wavelet transform as a moder...
متن کاملA Comparative Study of the Neural Network, Fuzzy Logic, and Nero-fuzzy Systems in Seismic Reservoir Characterization: An Example from Arab (Surmeh) Reservoir as an Iranian Gas Field, Persian Gulf Basin
Intelligent reservoir characterization using seismic attributes and hydraulic flow units has a vital role in the description of oil and gas traps. The predicted model allows an accurate understanding of the reservoir quality, especially at the un-cored well location. This study was conducted in two major steps. In the first step, the survey compared different intelligent techniques to discover ...
متن کاملAdaptive fuzzy sliding mode and indirect radial-basis-function neural network controller for trajectory tracking control of a car-like robot
The ever-growing use of various vehicles for transportation, on the one hand, and the statistics ofsoaring road accidents resulting from human error, on the other hand, reminds us of the necessity toconduct more extensive research on the design, manufacturing and control of driver-less intelligentvehicles. For the automatic control of an autonomous vehicle, we need its dynamic...
متن کاملModeling of streamflow- suspended sediment load relationship by adaptive neuro-fuzzy and artificial neural network approaches (Case study: Dalaki River, Iran)
Modeling of stream flow–suspended sediment relationship is one of the most studied topics in hydrology due to itsessential application to water resources management. Recently, artificial intelligence has gained much popularity owing toits application in calibrating the nonlinear relationships inherent in the stream flow–suspended sediment relationship. Thisstudy made us of adaptive neuro-fuzzy ...
متن کاملDesign and Simulation of Adaptive Neuro Fuzzy Inference Based Controller for Chaotic Lorenz System
Chaos is a nonlinear behavior that shows chaotic and irregular responses to internal and external stimuli in dynamic systems. This behavior usually appears in systems that are highly sensitive to initial condition. In these systems, stabilization is a highly considerable tool for eliminating aberrant behaviors. In this paper, the problem of stabilization and tracking the chaos are investigated....
متن کامل